A Deep Dive into Self-Attention and Multi-Head Attention in Transformers
medium.com·18h·
Discuss: r/LocalLLaMA
🧠LLM Inference
Flag this post
EyesOff: I Built a Screen Contact Detection Model
ym2132.github.io·22h·
Discuss: Hacker News
📊Vector Databases
Flag this post
Generative AI and the P=NP problem
lesswrong.com·14h
🧮SMT Solvers
Flag this post
TSU 101 an New Type of Computing Hardware
extropic.ai·7h·
Discuss: Hacker News
🔢BitNet Inference
Flag this post
Attention really is all you need — The Encoder
pub.towardsai.net·1h
🧠LLM Inference
Flag this post
GNN From Scratch
cultured-avenue-f13.notion.site·7h·
Discuss: r/programming
🔢BitNet
Flag this post
ML Systems Textbook by Havard
mlsysbook.ai·5h·
Discuss: Hacker News
🏗️LLM Infrastructure
Flag this post
Profiling Go Programs using Pprof and k6
pears.one·8h·
Discuss: r/golang
💾Prompt Caching
Flag this post
Software optimizes brain simulations, enabling them to complete complex cognitive tasks
medicalxpress.com·15h
🆕New AI
Flag this post
On Generative AI Imagery
xn--gckvb8fzb.com·19h
🎭Claude
Flag this post
Archimedes – A Python toolkit for hardware engineering
pinetreelabs.github.io·11h·
Discuss: Hacker News
🛠️Build Optimization
Flag this post
Reauthoring and Converting models for edge inference: MambaV2 on LiteRT
sachinjoglekar.substack.com·17h·
Discuss: Substack
🏗️LLM Infrastructure
Flag this post
AMD Enterprise AI Suite
enterprise-ai.docs.amd.com·22h·
Discuss: Hacker News
🖥GPUs
Flag this post
How to share Nvidia GPUs that don’t support MIG and vGPU isn’t an option
shambu.bearblog.dev·8h
🖥GPUs
Flag this post
Teaching AI to see the world more like we do
deepmind.google·3h·
Discuss: Hacker News
🔍AI Interpretability
Flag this post
Built a Mac app that makes local AI actually simple to use
suverenum.ai·7h·
Discuss: r/LocalLLaMA
🤖AI
Flag this post
Finding a CPU Design Bug in the Xbox 360 (2018)
randomascii.wordpress.com·5h·
⚙️Mechanical Sympathy
Flag this post
Whipping up a new Shell – Lash#Cat9 | Arcan
arcan-fe.com·2h
📟Terminals
Flag this post